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Abstract 

Background and Objectives: Hepatitis C virus (HCV) variants tliat confer resistance to direct-acting-antiviral agents (DAA) 
have been detected by standard sequencing technology in genotype (G) 1 viruses from DAA-naive patients. It has recently 
been shown that virological response rates are higher and breakthrough rates are lower in Gib infected patients than in 
Gla infected patients treated with certain classes of HCV DAAs. It is not known whether this corresponds to a difference in 
the composition of Gla and Gl b HCV quasispecies in regards to the proportion of naturally occurring DAA-resistant variants 
before treatment. 

Methods: We used ultradeep pyrosequencing to determine the prevalence of low-abundance {<25% of the sequence 
reads) DAA-resistant variants in 191 NS3 and 116 NS5B isolates from 208 DAA-naive Gl-infected patients. 

Results: A total of 3.5 million high-quality reads of >200 nucleotides were generated. The median coverage depth was 
4150x and 4470x per NS3 and NS5B amplicon, respectively. Both Gla and Gib populations showed Shannon entropy 
distributions, with no difference between Gla and Gib in NS3 or NS5B region at the nucleotide level. A higher number of 
substitutions that confer resistance to protease inhibitors were observed in Gla isolates (mainly at amino acid 80 of the NS3 
region). The prevalence of amino acid substitutions that confer resistance to NS5B non-nucleoside inhibitors was similar in 
Gla and Gib isolates. The NS5B S282T variant, which confers resistance to the polymerase inhibitors mericitabine and 
sofosbuvir, was not detected in any sample. 

Conclusion:lhe quasispecies genetic diversity and prevalence of DAA-resistant variants was similar in Gl a and Gl b isolates 
and in both NS3 and NS5B regions, suggesting that this is not a determinant for the higher level of DAA resistance observed 
across Gla HCV infected patients upon treatment. 
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Introduction 

Advances in t±ie knowledge of the .structure and function of 
hepatitis C virus (HCV) proteins and the development of robust 



methods for studying HCV replication in vitro have resulted in the 
development of direct acting antiviral agents (DAAs) that target 
essential proteins, primarily the NS3/4A serine protease, the RNA 
dependent RNA polymerase (RdRp, NS5B) and NS5A. Three 
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agents that inhibit the NS3/4A serine protease are now approved 
for clinical use (telaprevir, boceprevir and srmeprevir). These 
protease inhibitors (Pis) are potent inhibitors of HCV replication 
in vivo; however, resistance develops quickly even when these 
drugs are administered in combination with peginterferon/ 
ribavirin. [1-3] Indeed, amino acid substitutions within the NS3 
protease region have been identified in vitro and in vivo and 
correlated with reduced susceptibility to all Pis, including those in 
development (Table 1). 

Several NS5B polymerase nucleoside analog inhibitors (NIs) 
have shown clinical efficacy in combination with peginterferon/ 
ribavirin, and in interferon-free regimens. These NIs include 
sofosbuvir, now approved for the treatment of HCV infection [4] 
(GS7977, a phosphoramidate prodrug of 2'-deoxy-2'-0(-fluoro-P- 
C-methyluridine-5'-monophosphate), mericitabine (RG7128, pro- 
drug of P-D-2'-deoxy-2'-fluoro-2'-C-methyl cytidrne), [5] and the 
guanosine-based NIs, now stopped for safety reasons (GS-938, 
IDX-184 and INX-189). Sofosbuvir and mericitabine treatment 
rarely select for resistant variants, identified as carrying substitu- 
tions S282T and/or L159F/L320F in NS5B. [6-9] There are also 
several investigational non-nucleoside NS5B inhibitors (NNIs) 
including ABT-072, ABT 333 and setrobuvir which bind to the 
Palm I allosteric domain, [10,11] BMS-791325 [12] and 
BI207127 which bind to the Thumb I site [13], and VX-222 
and filibuvir which bind to the Thumb II site. [14,15] In contrast 
to NTresistant NS5B variants, NNI-resistant variants emerge 
rapidly in vitro and in vivo. [11,15-18] However, NNI-resistant 
variants that confer resistance to inhibitors that bind to one 
allosteric site do not confer cross-resistance to inhibitors that bind 
to a different allosteric site (Table 2). 

HCV NS5B RdRp, lacking a proof reading function, mis- 
incorporates nucleotides at a rate of 1 per 10 000 bases copied. 
[19] The low fidelity of replication is compounded by a high 
replication rate that can result in the production of up to lO'^ 
virions per day. [20] As a result, HCV exists in any given patient 
as a diverse collection of closely related variants termed 
quasispecies. [21] Although the original definition of a quasispecies 



requires an effectively infinite population size, population genet- 
icists have nonetheless found quasispecies theory to be useful for 
finite viral populations with high mutation rates, and have 
generally accepted the use of the term quasispecies when applied 
to HCV and several other viral infections [22]. 

Recent studies on protease inhibitors telaprevir and boceprevir 
have shown higher virological response rates, and lower break- 
through rates, in Gib-infected patients than in G la-infected 
patients. [1,2,23,24] Similar results have been described for 
patients enrolled in Palm I NNI-based interferon-free regimen. 
[25,26] However, it is not known whether this corresponds to a 
different distribution of variants in Gla and Gib quasispecies in 
regards to the proportion that contain naturally occurring DAA- 
resistance mutations before treatment; it is also unknown what the 
possible imphcations for the response to DAA treatment could be. 

In this study, we sought to investigate the overall genetic 
diversity and the potential presence of low-abundant DAA- 
resistant variants (<25% of the sequence reads) in the NS3 and 
NS5B genes of HCV Gla and Gib viruses in isolates from >200 
DAA-naive patients using ultradeep pyrosequencing (UDPS) 
methodology. We characterized, on a quantitative scale, the 
abundance (defined as the frequency of a particular amino acid 
substitution in an isolate's quasispecies as low as 0.5%) and 
prevalence (defined as the proportion of patients which carries a 
particular amino acid substitution) of more than 30 established 
DAA-resistant substitutions and polymorphisms in the NS3 and 
NS5B regions. We also compared the genetic variability of Gla 
and Gib isolates to determine whether differences in variability 
might be responsible for the lower virological responses to HCV 
Pis and Palm I NNI -DAA-containing therapy in patients infected 
with Gla compared with Gib viruses. 

Materials and Methods 

Clinical isolates 

Samples from 208 Gl DAA-naive patients, most of whom were 
enrolled in Roche-sponsored global clinical trials (PV 18369, 



Table 1. List of known in vitro and in vivo amino acid substitutions resistant to NS3 protease inhibitors used in this study (NS3 
residues 31 to 175). 





NS3 amino 
acid position 


WT residue 




Resistance 
substitutions 


Protease Inhibitor* 




G1a 


G1b 






36 


V 


V 


A,I,L,M 


tinear (Boceprevir/Telaprevir) 


41 


Q 


Q 


R 


tinear (Boceprevir); Macrocyclic (Danoprevir/Simeprevir) 


43 


F 


F 


C,I,S,V 


tinear (Boceprevir); Macrocyclic (Danoprevir/Simeprevir) 


54 


T 


T 


A,C,G,S 


tinear (Boceprevir/Telaprevir) 


55 


V 


V 


A,l 


tinear (Boceprevir) 


SO 


Q,K 


Q,L 


H,K,R 


Macrocyclic (Danoprevir/Simeprevir) 


138 


S 


S 


T 


Macrocyclic (Danoprevir) 


155 


R 


R 


G,I,K,M,Q,S,T 


tinear (Boceprevir/Telaprevir); Macrocyclic (Danoprevir/Simeprevir) 


156 


A 


A 


G,S,T,V 


tinear (Boceprevir/Telaprevir); Macrocyclic (Danoprevir/Simeprevir) 


158 


V 


V 


1 


tinear (Boceprevir) 


168 


D 


D 


A,E,G,H,I,N,T,Y,V 


Macrocyclic (Simeprevir/Danoprevir) 


170 


1 


V,i 


A,F,T 


tinear (Boceprevir) 


175 


t 


M 


t 


tinear (Boceprevir) 



*For conciseness, 2 representatives from each PI class were chosen [1-3,35,37,50-53]. 
doi:1 0.1 371/journal.pone.01 05569.t001 
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Table 2. List of known in vitro and in vivo amino acid substitutions resistant to NS5B polymerase inhibitors used in this study 
(NS5B residues 244-496). 





NS5B amino 
acid position 


WT residue 




Resistance 
substitutions 


Polymerase Inhibitor* 




Gla 


Gib 






282 


S 


S 


T 


Nucleoside Inhibitor (Mericitabine, Sofosbuvir) 


316 


c 


C,N 


N,Y 


NNI/Palm 1 (ABT-072/ABT-333, Setrobuvir, Tegobuvir) 


320 


L 


L 


1, F 


Nucleoside Inhibitor (GS-938, Mericitabine)** 


321 


V 


V 


1 


Nucleoside Inhibitor (GS-938)** 


392 


L 


L 


1 


NNI/Palm 1 (ABT-072/ABT-333) 


411 


N 


N 


S 


NNI/Palm 1 (ABT-072/ABT-333) 


414 


M 


M 


l,L,T,V 


NNI/Palm 1 (ABT-072/ABT-333, Setrobuvir) 


419 


L 


L 


l,M 


NNI/Thumb II (Filibuvir, VX-222) 


421 


A 


A 


V 


NNI/Thumb 1 (BMS-791325) 


422 


R 


R 


K 


NNI/Thumb II (Filibuvir, VX-222) 


423 


M 


M 


A,I,T,V 


NNI/Thumb II (Filibuvir, VX-222) 


426 


M 


M 


A,T 


NNI/Thumb II (Filibuvir) 


445 


C 


C 


F,Y 


NNI/Palm 1 (Setrobuvir, Tegobuvir) 


448 


Y 


Y 


H, C 


NNI/Palm 1 (ABT-072/ABT-333, Setrobuvir, Tegobuvir) 


451 


C 


C 


R 


NNI/Palm 1 (Setrobuvir) 


452 


Y 


Y 


H 


NNI/Palm 1 (Tegobuvir) 


482 


1 


1 


L,S,T 


NNI/Thumb II (Filibuvir, VX-222) 


494 


V 


V 


A,I,T 


NNI/Thumb II (Filibuvir) 


495 


P 


P 


A,L,S 


NNI/Thumb 1 (Bl 207127, BMS-791325) 


496 


P 


P 


A,S,T 


NNI/Thumb 1 (Benzimidazole derivatives) 



*[6,9,1 1,15-18,54,55]. 

**L320F when in combination with LI 59F confers low level resistance to MCB [9], L320I or V321 1 when in combination with C223H confers low level resistance to GS-938 
[54]. 

doi:l 0.1 371 /journal.pone.01 05569.t002 



NV19865, PP22205, NP22660, NV21075 and PP25213, repre- 
senting >90% of the isolates, remaining isolates being purchased 
from the American Red Cross) [27-32], with a median viral load 
of 3.6x10*^ lU/mL (range 7.6xl0*-5.9xl0' lU/mL) were 
studied. All studies were global and conducted in fuU conformance 
with the principles of the Declaration of Helsinki and Good 
Clinical Practice. Protocols and all amendments were reviewed 
and approved by local ethics committees and regulatory author- 
ities. Written informed consent was obtained from all patients 
before any study-related activities occurred. The authors were not 
involved in any of the original sample collections and samples were 
de-identified prior to being accessed by the authors. 

HCV RNA was extracted from 400 jxl to 800 ^tl of serum from 
HCV-infected patients using the ZR Whole-blood Total RNA kit 
(Zymo Research, Irving, CA, USA) and following the manufac- 
turer's instructions. 

Population-based dideoxy-terminator (Sanger) 
sequencing 

Population sequencing spanning the NS5B and NS3/ 4A coding 
regions was performed by Sanger sequencing using primers 
covering both DNA strands using an ABI 3 7 30 xl DNA Analyzer. 
Chromatograms were analyzed using Sequencher (Gene Codes 
Corporation, Ann Arbor, MI, USA) and Vector NTI (Life 
Technologies, Carlsbad, CA, USA) software. Population sequence 
of each sample was then used for the 454 reads analysis (see section 
454 sequence analysis). 



HCV amplification for 454 sequencing 

Reverse transcription (RT) was performed using the Transcrip- 
tor High Fidelity cDNA Synthesis Kit (Roche Diagnostics, 
Indianapolis, IN, USA) following the manufacturer's instructions, 
using random hexamers as primers for initiation and 1 to 9.15 |.lL 
of HCV RNA (depending on sample availability) per reaction. 
The RT cycling conditions were as follow: 5 minutes (min) at 
25°C, 30 min at 50°C, 5 min at 85°C. For clinical isolates 
obtained from patients with an HCV viral load <3.4xl0' lU/ 
mL, cDNAs were pooled (between 2 to 6 RT reactions, depending 
on sample availability) and concentrated using the Zymo Research 
DNA Clean & Concentrator-5 (Zymo Research, Irvine, CA, USA) 
following the manufacturer instruction. 

Amplification of NS5B region (generating a 985 nucleotide 
fragment for G 1 a samples and a 9 1 6 nucleotide fragment for G 1 b 
samples) and NS3 region (generating a 750 nucleotide fragment 
for G 1 a samples and an 806 nucleotide fragment for G 1 b samples) 
was carried out in duplicate using the FastStart High Fidelity PCR 
System (Roche Diagnostics, Indianapolis, IN, USA) as follows: 
4 \xL cDNA (or 2 |J.l concentrated cDNA) was included in a 50 \xL 
reaction mixture containing 5 |J,L Buffer ^2 (IX final with 
1.8 mM MgClz), 1 |XL dNTPs (0.2 mM final each), 1 nL each 
forward and reverse primers (0.4 |a,M final each) and 0.5 |J.L 
enzyme (2.5 Units). PCR reactions were performed as foUow: 
3 min at 94°C, then 30 cycles of 30 seconds (sec) at 94°C, 30 sec 
at 52°C, and 1 min at 72°C, followed by a final 7 min extension 
step at 72°C. The duplicate products of this first round PCR 
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(PCRl) were pooled and then amplified in a nested PGR (PCR2) 
using 454-fijsion primers containing custom 7-mer bar codes 
(designed to be able to pool amplified DNA from different patient 

samples for the UDPS reactions) and the same cycling conditions 
as PCRl. The PCR2 reaction produced 3 smaller overlapping 
amplicons (308-368 nucleotides) encompassing NS5B amino acids 
244 to 496 and 2 smaller overlapping amplicons (244—356 
nucleotides), encompassing NS3 amino acids 31 to 175 and 30 
to 190 for Gla and Gib isolates, respectively. 

The PCR2 products were double purified using AMPure beads 
(Beckman Coulter, Brea, CA, USA), quantified using Quant-iT 
PicoGreen dsDNA Reagent (Life Technologies, Carlsbad, CA, 
USA), pooled at equimolar concentrations and pyrosequenced 
using the 454/Roche GS FLX titanium platform producing reads 
of 331 bp on average. All amplification primers are given in 
Tables SI and S2 in File SI. 

454 sequence analysis 

Standard flowgram format (SFF) files, the raw output from 
UDPS, were processed to generate paired files containing FASTA 
sequence reads and Phred-equivalent quality scores for each 
sequence library. To reduce sequence artifacts, reads shorter than 
200 nucleotides and reads containing one or more bases with a 
quality score of <10 (>10% probability of sequence error) or a 
mean quality score <25 (>0.3% probability of sequence error per 
base per sequence read) were excludc-d. Sequenced reads were de- 
multiplexed using the 5' primer and barcode sequences, resulting 
in the assignment of each read to a patient sample and primer- 
pair, and each UDPS read was aligned to the population-based 
sequence of each sample using the MosaikAligner program 
(http://bioinformatics.bc.edu). For the calling of nucleotide 
mutations and amino acid substitutions, each amplicon was 
compared to the corresponding subtype reference sequence, H77 
for Gla (GenBank accession number M67463) and Con I for Gib 
(GenBank accession number AJ238799). High-abundance variants 
were defined as mutations present in >25% of the sequence reads 
(as is conventionally defined for the Sanger sequencing detection 
threshold), and low-abundance variants were defined as mutations 
present in <25% of the sequence reads. 

AU sequence data have been deposited at the NCBI Sequence 
Read Archive under study accession SRP040802. 

The technical error rate was estimated for each UDPS run by 
amplifying and sequencing two HCV reference plasmids, Gla 
H77 and Gib Conl using the same PGR protocol described for 
the clinical isolates. Each UDPS read was aligned to the 
corresponding plasmid sequence using the MosaikAligner pro- 
gram (http://bioinformatics.bc.edu) and the number of mismatch- 
es was counted. Homopolymeric regions were defined as regions 
with three or more identical consecutive nucleotides and their 
flanking nucleotides [33]. 

Distribution of NS3 and NS5B quasispecies variants 

The number and frequency of distinct virus variants (sometimes 
referred to as "haplotypes") in each sample was estimated using 
the computational method ShoRAH (Short Reads Assemby into 
Haplotypes), which incorporates serjuencc error calculations to 
avoid overestimating the number of distinct genetic variants in a 
sample. [34] After removing virus variants present at frequency 
lower than 0.5%, the intra-sample quasispecies Shannon entropy, 
SQS(nt)> was calculated: [21]. 

"Pihipi 



where pi is the frequency of sequence variant i, n is the total 
number of variants and N is the average number of reads per 
amplicon. Each distinct variant identified by ShoRAH was 
translated to its corresponding amino acid sequences. Variants 
with same amino acid sequences were pooled and the Shannon 
entropy for the amino acid sequences (SQS(aa)) was calculated in a 
manner analogous to SQS(nt)- Shannon entropy was chosen to 
characterize the mutant spectra of viral quasispecies in our 
samples, because it is a simple normalized quantitative measure of 
variability (randomness) in a sequence dataset which incorporates 
both the frerjuency and the number of possible sequence variants 
into a single metric. In the context of quasispecies analysis, a 
higher Shannon entropy (e.g. 0.25) typically implies that a virus 
population consists of a large number of distinct sequence variants 
(e.g. ~15 or more) occurring at low to moderate frequency, 
whereas a relatively low Shannon entropy (e.g. 0.02) often 
indicates a more ordered virus population which may contain 
just one to a few (e.g. ~3) sequence variants occurring at 
significant frequency [21]. 

Statistical inference 

The Mann-Whitney-WUcoxon test was used to assess the 
statistical significance of differences between the median Shannon 
entropy between subtypes and genes. The Fisher's exact test was 
used to determine if the levels of amino-acid conservation in the 
NS3 and NS5B regions differed between Gla and Gib isolates by 
using the fisher. test function for count data, as implemented 
in the R statistical package. 

NS3 and NS5B drug-resistant amino acid substitutions 

included in the study 

Study-defined NS3 and NS5B DAA-resistant amino acid 
substitutions are listed in Tables 1 and 2. 

Results 

UDPS coverage and technical error rate according to 

gene and genotype 

UDPS yielded a total of 3.5 million high quality sequence reads 
of 200 or more nucleotides from 136 Gla and 55 Gib NS3 
samples and from 77 Gla and 39 Gib NS5B samples. Overall, 
tiiere was a median of 8,399 reads (IQR: 6,943-11,151) for tiie 
191 NS3 samples and a median of 14,043 reads (IQR: 11347- 
16142) for the 116 NS5B samples. A total of 34 HCV plasmid 
controls were run (10 Gla and 8 Gib NS3 controls, and 10 Gla 
and 6 Gib NS5B controls) and a total of 339,769 sequenced reads 
with >200 bases were generated over 1 1 plates for the 34 controls: 
105,210 for Gla NS3 and 80,462 for Gla NS5B; and 75,182 for 
Gib NS3 and 78,915 for Gib NS5B. 

The median mismatch error rate (or technical error rate), 
determined using Gla H77 and Gib Conl plasmid DNA, was 
7.0x10"* overaU: 4.0x10"* in non-homopolymeric regions and 
1.4x10 ^ in homopolymeric regions. The median mismatch error 
rate was the same for the NS3 and NS5B genes and for Gla and 
Gib plasmids. Assuming that errors occurred in a Poisson 
distribution and that samples contained one thousand viral 
templates, and adding the published error rate of the Roche 
Transcriptor High Fidelity reverse transcriptase (1.98x10 '), the 
likelihood that technical artefact would cause a mutation to be 
detected at a level of 0.5% or higher would be 7.7 x 10 ''in non- 
homopolymeric regions and 1.5x10 ^ in homopolymeric regions. 
The likelihood that technical artefact could cause a mutation to be 
detected at a level of 1.0% or higher would be <1.0xlO '° in 
non-homopolymeric regions and 2.5x10 ^ in homopolymeric 
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regions. Using tlie overall median error rate of 7.2x10" and a 
Bonferroni correction for testing 13 Pl-resistant variants, 3 NI- 
resistant variants, and 1 8-NNI-resistant variants, the likelihood of 
detecting any PI-, NI-, or NNI-resistance mutation at a level of 
0.5% or higher in a sample would be 1.2x10 ^, 2.7x10 and 
1.6x10"^, respectively. To prevent false positives, a conservative 
cut-off (SO.5%) for variant detection was adopted as the minimum 
threshold in calling high- and low-abundance mutations. 

Distributions of quasispecies diversity among Gla and 
Gib isolates 

To evaluate inter-patient genetic variation across the NS3 and 
NS5B quasispecies obtained by UDPS, the histograms of Shannon 
entropy of quasispecies (see Materials and Methods for definition) 
were plotted for all samples combined (Gla and Gib), or 
separately by subtype, in nucleotides (Figure lA) and in amino 
acids (Figure IB). 

The median Shannon entropy for the NS3 nucleotide sequences 
was similar for the 136 Gla (0.25, Inter Quantile Range [IQR]: 
0.19 to 0.29) and 55 Gib (0.26. IQR: 0.23 to 0.29) samples 
(p = 0.1; Mann Whitney Test) (Figure lA). The median Shannon 
entropy for the NS5B nucleotide sequences was similar for the 77 
Gla (0.28, IQR: 0.23 to 0.31) and 39 Gib (0.28, IQR: 0.22 to 
0.31) samples (p = 0.3; Mann Whitney Test) (Figure lA). 

The median Shannon entropy for the translated NS3 sequences 
was similar for the 136 Gla (0.09, IQR: 0.04 to 0.12) and 55 Gib 
(0.09, IQR: 0.06 to 0.13) samples (p = 0.2) (Figure IB) but it was 
higher for the translated NS5B sequences for the 77 Gla (0.19, 
IQR: 0.14 to 0.24) compared widi the 39 Gib (0.13, IQR: 0.09 to 
0.17) samples (p<0.001) (Figure IB). 

The amino acid sequences showed less variability (i.e. lower 
Shannon entropy) than the nucleotide sequences (SQS(i,a): ~0 to 
0.24 vs SQS{ut): ~0 to 0.31, zero-th to 75th percentile. Figures lA 
& B). Both Gla and Gib virus populations exhibit Shannon 
entropy ranges spanning S~0 and S~0.3 (Figures lA & B), 
consistent with the notion that sequence diversity in HCV patient 
samples can vary among samples. 



Prevalence of high-abundance NS3 resistance amino acid 

substitutions (present in >25% of UDPS reads) 

Among the 136 Gla NS3 isolates, 48 (33%) of the 145 
sequenced residues had one or more high-abundance amino acid 
substitutions, whereas the remaining 97 (67%) sequenced residues 
had no high-abundance variants (Table 3). Among the 55 Gib 
NS3 isolates, 33 (23%) of the 145 serjuenced residues had one or 
more high-abundance amino acid substitutions, whereas the 
remaining 112 (77%) of sequenced residues had no high- 
abundance variants (Table 3). 

The polymorphic Pl-resistant substitution Q80K, which is 
associated with reduced low-level susceptibility to the macrocyclic 
Pis simeprevir and danoprevir, was present in high abundance (in 
30 to 100% of the reads) in 30 (22%) of the 136 Gla isolates but in 
none of the 55 Gib isolates (p<0.001; Fisher's Exact Test). 
Additional high-abundance Pl-resistant amino acid substitutions in 
Gla samples were found (in 41% to 99.8% of the reads), including 
V36M (n = 1), T54S (n = 4), V55A (n = 2) and V55I (n = 3), which 
confer resistance to linear Pis, and Q80R (n=3) and D168E 
(n=l), which confer resistance to macrocyclic Pis. One Gib 
sample contained the Pl-resistant variant T54S (n = 1, in 99.6% of 
the reads). OveraU (including Q80K), Gla isolates (44/136; 32%) 
were more likely to contain one or more high abundance amino 
acid substitutions than were Gib isolates (1/55, 2%; p<0.001; 
Fisher's Exact Test). Excluding Q80K, the difference in the 
proportions of high-abundance simino acid substitutions between 
Gla (14/136, 10.3%) and Gib (1/55; p = 0.07, Fisher's Exact 
Test) was smaller. In all but one Gla isolates, a Leucine residue 
was present in NS3 position 175. This amino acid variant was not 
included in our analysis of potential resistance-associated variants, 
because it has only been associated with resistance to boceprevir in 
the Gib context. 
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Figure 1 . Histograms of baseline quasispecies diversity in the study population, as represented by Shannon entropy in nucleotides 
(A) and in amino acids (B). Sequence variants were assembled using ShoRAH (as described in material and methods) and only variants with 

frequencies 2:0.5% were included. 
doi:l 0.1 371/journal.pone.01 05569.g001 
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Prevalence of low-abundance NS3 resistance amino acid 
substitutions (present in <25% of UDPS reads) 

Among the 136 NS3 Gla isolates, 20 (15%) had one or more 
low-abundance Pl-resistant amino acid substitutions (in 0.5% to 
15.2% of the reads) associated with reduced susceptibility to Pis: 
V36A (n= 1), V55A (n = 2), V55I (n= 1), Q80K (n = 2), Q80R 
(n = 4), R155K (n=l), A156G (n=l), A156T (n= 1), V158I 
(n=l), D168N (n=l), D168V (n= 1), I170T (n = 2) and dual 
amino acid substitutions Q80R/I170T (n=l) and R155K/ 
D168G (n= 1) (Table 4). 

Among the 55 Gib isolates, five (9%) contained low-abundance 
Pl-resistant amino acid substitutions (in 0.6% to 3.7% of reads) 
associated with reduced susceptibility to Pis: V36A (n = 1), V170A 
(n= 1), V170T (n= 1), and M175L (n= 1) and dual amino acid 
substitutions D168E/M175L (n=l) (Table 4). There was no 
significant difiFerence in the prevalence of low-abundance Pl- 
resistant amino acid substitutions between Gla (15%, 20/136) and 
Gib isolates (9%, 5/55; p = 0.4, Fisher's Exact Test). 

Low-abundance variants at amino acid positions associated with 
PI resistance, but containing amino acid substitutions not 
previously described as conferring resistance to Pis, were present 
at low prevalence (Q41H (n = 2), F43L (n = 7), V55N (n= 1), 



Q80L (n = 5), S138P (n = 5), V158A/G/M (n = 3), in Gla isolates; 
and F43L (n= 1), Q80L (n = 5), V170M (n= 1) in Gib isolates) 
(data not shown). 

Prevalence of high-abundance NS5B resistance amino 
acid substitutions (present in >25% of UDPS reads) 

Amino acid substitutions in NS5B detected in ^25% of UDPS 
reads after translation of the 7 7 G 1 a and 39 G 1 b isolates are listed 
in Table 5. Of the 253 sequenced NS5B amino acid residues, 179 
(71%) did not show a substitution present in £25% of the UDPS 
reads in any of the isolates. The remaining 74 NS5B amino acid 
residues (29%) had one or more substitution present in a25% of 
the UDPS reads (Table 5). Among them, NNTresistance amino 
acid substitutions were present in 21 Gla isolates (M414V [n= 1] 
and C445F [n= 1] in the Palm I domain, A421V [n= 15] in the 
Thumb I domain, I482L [n = 1] in the Thumb II domain and dual 
amino acid substitutions A421V/M423I [n = 2] and A421V/ 
M423V [n=l]) and 10 Gib isolates (C316N [n = 3], M414L 
[n= 1],M414T [n= 1] and dual amino acid substitutions C316N/ 
M414L [n = 1] in Palm I domain and A42 IV [n = 3] and P496A 
[n= 1] in the Thumb I domain), all present in 72.2% to 100% of 
the reads. No Nl-resistant variants were detected. There was no 



Table 4. Low-abundance NS3 Pl-resistant amino acid substitutions found in 20/136 Gla and in 5/55 Gib treatment-naTve HCV 
samples. 



Sample ID 


HCV Genotype 


Low-abundance PI 
resistance^ 


Class of Protease 
inhibitor 


Pt 4 


1a 


R155K (0.6%= 14/2301) 


Linear and Macrocyclic 


Pt 7 


1a 


A156T (0.8% = 20/2281) 


Linear and Macrocyclic 


Pt 40 


la 


11701 (0.6% =17/2575) 


Linear 


Pt 42 


la 


Q80K (15.1% = 8490/55864) 


Macrocyclic 


Pt 50 


la 


Q80R (67% = 826/1 2 154) 


Macrocyclic 


Pt 52 


la 


D168V (0.9% = 48/5047) 


Macrocyclic 


Pt 54 


la 


V55A (0.5% = 15/2993) 


Linear 


Pt 59 


la 


V55I (15.2% = 543/3559) 


Linear 


Pt 60 


la 


V55A (0.5% = 24/4664) 


Linear 


Pt 63 


la 


Q80R (0.6% = 43/6945) 


Macrocyclic 


Pt 65 


la 


A156G (3.5%= 123/3512) 


Linear and Macrocyclic 


Pt 67 


la 


D168N (3.8% =170/4390) 


Macrocyclic 


Pt 68 


la 


V36A (1 .3% = 39/2795) 


Linear 


Pt 70 


la 


I170T (0.8% = 51/6061) 


Linear 


Pt 71 


la 


VI 581 (0.5% = 26/481 3) 


Linear 


Pt 73 


la 


Q80R (17% = 331/18914) 


Macrocyclic 


Pt 74 


la 


Q80K (1% = 85/8202) 


Macrocyclic 


Pt 76 


la 


Q80R (0.5% = 59/1 1582) 


Macrocyclic 


Pt 77 


la 


Q80R (0.8% =189/21 037), 
I170T (0.7% = 62/8663) 


Linear and Macrocyclic 


Pt 81 


la 


R155K (0.5% = 31/5635), 
D168G (0.5% = 30/561 2) 


Linear and Macrocyclic 


Pt 33 


lb 


D168E (2.7% =129/4688), 
M175L (3.7% = 1 76/4687) 


Linear and Macrocyclic 


Pt 82 


lb 


V36A (0.6%= 13/2133) 


Linear 


Pt 83 


lb 


V170T (1.3% = 34/2504) 


Linear 


Pt 84 


lb 


/VII 75L (0.6% = 48/6974) 


Linear 


Pt 85 


lb 


V170A (0.7% =12/1 683) 


Linear 



^In parenthesis, abundance of the variant is indicated (in % and number of reads in which the variant is detected over the number of total reads). 
doi:1 0.1 371/journal.pone.01 05569.t004 
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significant difiFerence in the prevalence of high-abundance NNI- 
resistant variants between subtypes Gla and Gib (21/77 vs 10/ 
39; p>0.1, Fisher's Exact test). 

Prevalence of low-abundance NS5B resistance amino 
acid substitutions (present in <25% of UDPS reads) 

Low-abundance NNI-resistant variants were present in 29 
(38%) of the 77 Gla and 12 (31%) of the 39 Gib samples (p = 0.5; 
Fisher's Exact test). The 29 Gla samples contained the following 
NNI-resistant amino acid substitutions in 0.5% to 20% of reads: 
C316Y (n=l), M414T (n=l), M414V (n=l), C445Y (n=l), 
Y448C (n = 1), Y448H (n = 4), Y452H (n = 1) and dual amino acid 
substitutions M414T/M414V (n=l) in the Palm I domain; 
A421V (n= 10) in the Thumb I domain; M423I/I482T (n= 1), 
V494A (n = 2), V494I (n = 1) in the Thumb II domain, and dual 
amino acid substitutions of C316Y/A421V (n=l), M414T/ 
V494A (n=l), A421V/M426T (n=l), M423T/Y452H (n=l) 
(Table 6). The GS-938-NI-resistant variant V321I was present in 
0.7% of reads of one Gla sample. 

The 12 Gib samples contained the following NNI-resistant 
amino acid substitutions in 0.6% to 2 1 .3% of reads: Y448C (n = 1), 
Y452H (n = 4) in the Palm I domain; A421V (n= 1) and P496S 
(n= 1) in the Thumb I domain; M426T (n= 1), V494I (n= 1) in 
the Thumb II domain and dual amino acid substitutions of 
M414I/I482L (n=l), A421V/Y448H (n=l), Y448H/V494I 
(n = 1) (Table 6). 

The Nl-resistant variants S282T and L320I/F were not present 
in any read from Gla or Gib samples. 

Most natural DAA-resistant variants detected involved 

the lowest genetic barrier (single transition) 

Overall, 75% (18/24) of the NI- and NNI-resistant amino acid 
substitutions detected in Gla or Gib isolates were attributed to a 
single transition, while 21% (5/24) were the results of a single 
transversion (Table 7). No observed NS5B substitution repre- 
sented a double transversion from wild type. Only 4% (1/24) 
involved 1 transition and 1 transversion (C316N). Similarly, 69% 
(13/19) of the Pl-resistant amino acid substitutions detected in 
Gla or Gib isolates were caused by a single transition whereas 
26% (5/19) were due to a single transversion, and only 5% (1/19) 
involved two transitions (Table 7). 

Discussion 

Resistance profiles of the first four approved DAAs agents (the 
protease inhibitors telaprevir, boceprevir and simeprevir and 
NS5B polymerase inhibitor Sofosbuvir) have been well character- 
ized and clinical resistance resulting in treatment failure has been 
reported for the first three. [1-3] It has recently been shown that 
virological response rates are higher and breakthrough rates are 
lower in Gib infected patients than in Gla infected patients 
treated with certain classes of DAAs such as Pis and Palm I NNIs 
[1,2,23-26]. 

As a result of a lack of a proof reading function in the HCV 
polymerase resulting in a high rate of spontaneous mutation, 
variants emerge constantly and, with a DAA selection pressure 
combined with a replicative advantage, may quickly Ix-come 
dominant in the population. It is not known if a higher level of 
genetic variability exists in Gla than Gib, or if elevated variability 
may be associated with treatment response to DAA's. A thorough 
characterization of the genetic diversity and prevalence of DAA- 
resistant variants in treatment-naive HCV infected patients will 
therefore shed light on their potential clinical relevance and 
impact. 



In this study, we tested the hypothesis of whether intrinsic 
genetic variability across Gl subtypes is directiy associated with 
the differential response rates between Gla- and Gib-infected 
patients treated with DAAs. For this, we performed an extensive 
UDPS analysis that included 191 NS3 and 1 16 NS5B isolates from 
208 HCV-infected DAA-naive patients infected by a Gla or Gib 
virus with viral load of the patients spanning almost three orders of 
magnitude (see Materials and Methods). To our knowledge, this is 
the largest UDPS study on HCV diversity in treatment-naive 
patients up to now. 

We observed a higher prevalence of high-abundance variants 
containing a Pi-resistant variant/polymorphism in Gla than Gib 
isolates. However, this higher prevalence is mainly driven by a 
polymorphism at position 80 that, in the case of simeprevir, pre- 
existence of Q80K polymorphism has been shown to affect the 
sustained virologic response (SVR) rate. [3] Amino acid substitu- 
tion at Q80 have not been identified in telaprevir or boceprevir 
clinical trials [1,2] and have no significant effects on the activity of 
telaprevir or boceprevir in in vitro experiments. [35] Low- 
abundance Pl-resistant variants were also identified in both Gla 
and Gib isolates, however no significant difference between Gla 
and Gib was observed. The resistance mutations V36M and 
R155K in Gib isolates and V170A in Gla isolates have higher 
genetic barriers and require two nucleotide changes from wild- 
type (Table 7). Consistent with this phenomenon, we found no 
pre-existing V36M and R155K mutations among Gib isolates and 
no V170A mutations among Gla isolates in the set of high- and 
low-abundance mutations from this study. 

The impact of low-level pre-existing Pl-resistant variants on 
treatment outcome is yet to be fully determined. Recentiy 
published studies using UDPS methods and following a small 
number of Pi-treated patients have shown that, in some (but not 
all) patients, pre-existing low-level Pl-resistant variants could have 
affected their response to treatment [36] or re-treatment [37]; the 
presence of Pl-resistant variants at basehne did not necessarily 
prevent a patient from responding to treatment and Pl-resistant 
variants at basefine were not selectively enriched upon treatment. 
[29] Our findings support the notion that the relationships 
between pre-existing Pl-resistant variants and treatment outcome 
are likely to be complex and may depend on virus genotype, 
genetic barriers of a particular resistant variant as well as the 
prevalence of the variant, among other factors. 

In contrast to the NS3 region, no significant difference between 
Gla and Gib isolates was observed in the prevalence of high- 
abundance drug-resistant variants in the allosteric binding sites of 
the NS5B region. The results obtained here are also in agreement 
with previous studies that showed an overall 10 to 20% prevalence 
of variants at known drug resistance sites in the NS5B sequences 
from treatment-naive individuals. [38,39] Low-abundance drug- 
resistant variants in the allosteric binding sites of the NS5B region 
were detected in ~20 to 30% of the isolates, with no difference 
between Gla and Gib isolates. 

The impact of NNI resistance variants abundance on drug 
susceptibiUty was demonstrated in an in vitro phenotypic study, 
showing that NNI-resistance variants present >25% decrease 
significanfly the NNI potency. On the other hand, NNI-resistance 
variants present<25% did not necessarily have an impact on NNI 
potency [38]. 

This study also looked at the potential pre-existence of 
Sofosbuvir or Mericitabine resistance mutations. The NS5B 
nucleos(t)ide inhibitor-resistant variant S282T, was not found. 
To date, no S282T variant, has been found in treatment-naive GT 
1 isolates above the assay detection limit of sensitive sequencing 
technology such as UDPS, as shown in this study (n = 1 16) and in a 
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Table 6. Low-abundance NS5B drug-resistant variants found in 


30/77 Gla and In 12/39 Gib treatment-naive HCV samples. 




Sample ID 


HCV GT 


Low~abunclance 

Nl and NNI resistance^ 


Class of Polymerase 
inhibitor 


Pt 1 


la 


M414T (4.2% = 1 69/3979) 


Palm 1 


Pt 2 


la 


C316Y (0.8% = 53/6455) 


Palm 1 


Pt3 


la 


Y448H (0.6% =18/2607) 


Palm 1 


Pt 4 


la 


V494A (2.6% = 96/3691) 


Thumb II 


Pt 5 


la 


M414T (0.6% =19/2781), 
M414V (0.6%= 17/2781) 


Palm 1 


Pt 6 


la 


C316Y (0.6% = 52/7859), 
A421V (1.4% = 34/2406) 


Palm 1; Thumb 1 


Pt 7 


la 


M414T (0.5% =13/2595), 
V494A (1.9%=47/2381) 


Palm 1; Thumb II 


Pt 8 


la 


Y448H (1.1% = 32/2669) 


Palm 1 


Pt9 


la 


M423I (0.5% =13/2277), 
I482T (0.9% = 24/2564), 


Thumb II 


Pt 10 


la 


A421V (1.8% = 78/4256), 
M426T (0.6% = 28/4296) 


Thumb 1; Thumb II 


Pt 11 


la 


M423T (1.2% = 93/7606), 
Y452H (0.8% = 72/841 6) 


Thumb II; Palm 1 


Pt 12 


la 


C445Y (2.9% = 89/301 3) 


Palm 1 


Pt 15 


la 


V321I (0.7% = 56/7773) 


Nucleoside Inhibitor 


Pt 17 


la 


Y448H (0.7% = 42/5676) 


Palm 1 


Pt 18 


la 


Y452H (0.5%= 16/3012) 


Palm 1 


Pt 19 


la 


Y448H (1.1% = 38/3402) 


Palm 1 


Pt 20 


la 


V494A (1.7% = 74/4258) 


Thumb II 


Pt 21 


la 


Y448C (0.5% = 31/5402) 


Palm 1 


Pt 39 


la 


A421V (1.6% = 108/6516) 


Thumb 1 


Pt 44 


la 


A421V (0.7% = 20/2704) 


Thumb 1 


Pt 46 


la 


A421V (2.3% = 53/2281) 


Thumb 1 


Pt 51 


la 


V494I (20% = 378/1 896) 


Thumb II 


Pt 52 


la 


A421V (0.7% = 36/4661) 


Thumb 1 


Pt 57 


la 


M414V (2%= 116/5626) 


Palm 1 


Pt 59 


la 


A421V (1.5% = 51/3203) 


Thumb 1 


Pt 93 


la 


A421V (4.3% = 220/5009) 


Thumb 1 


Pt99 


la 


A421V (1.2% = 23/1 873) 


Thumb 1 


Pt 118 


la 


A421V (1.4% = 33/2224) 


Thumb 1 


Pt 129 


la 


A421V (C.7% = 28/3510) 


Thumb 1 


Pt 142 


la 


A421V (0.9% = 30/3 117) 


Thumb 1 


Pt 23 


lb 


Y452H (14.3% = 91 2/6346) 


Palm 1 


Pt 26 


lb 


Y452H (2 1.3% = 897/4202) 


Palm 1 


Pt 27 


lb 


Y452H (0.7% = 43/5376) 


Palm 1 


Pt 29 


lb 


M426T (0.6% = 35/51 67) 


Thumb II 


Pt 31 


lb 


P496S (1.8% = 75/3974) 


Thumb 1 


Pt 32 


lb 


Y448C (0.7% = 43/5681) 


Palm 1 


Pt 33 


lb 


M414I (0.7% = 24/3045), 
I482L (4.5% = 1 50/3275), 


Palm 1; Thumb II 


Pt 34 


lb 


A421V (0.6% = 34/4890) 


Thumb 1 


Pt 35 


lb 


A421V (0.5% = 31/531 7), 
Y448H (0.7% = 38/5426) 


Thumb I; Palm 1 


Pt 37 


lb 


Y452H (1.4% = 75/5059) 


Palm 1 


Pt 38 


lb 


Y448H (0.7% = 20/2629), 
V494I (0.6%= 14/2191) 


Palm 1; Thumb II 


Pt 161 


lb 


V494I (l.l% = 42/3793) 


Thumb II 


^In parenthesis, abundance of the variant 
doi:l 0.1 371 /journal.pone.Ol 05569.t006 


is Indicated (in % and number of reads 


in which the mutation is detected over the number of total reads). 
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recent study. [40] A novel combination of NS5B substitutions 
(L159F/L320F) conferring a low-level resistance to mericitabine 
has been described recently. [9] Variants with L320F substitution 
were not observed, corroborating the absence of resistance to these 
compounds at low level in the treatment-naive individuals 
included in this study. It is noteworthy that the only NS5B NI- 
resistant substitution detected in this study, V321I, was low- 
abundance and involves a low genetic barrier of a single transition. 
On the other hand, the Nl-resistant substitution S282T and the 
Nl-resistant combination L159F/L320F, which were not ob- 
served, have higher genetic barriers and require a transversion and 
two nucleotide changes, respectively. For the NS5B allosteric 
inhibitors, the observed resistance variants were single transitions, 
single transversions or a combination of single transition and single 
transversion (Table 7), with the majority being single transitions. 

The quasispecies diversity of each isolate in each region and the 
inter-patient variation was also determined by the Shannon 
entropy at the nucleotide and amino acid levels. The only 
difference was seen between Gla and Gib isolates when 
comparing the Shannon entropy calculated for the NS5B amino 
acid sequences (0.19 vs 0.13; p<0.001; Mann- Whitney Test). 
When comparing the Shannon entropy between two patient 
groups, a statistical difference at the amino acid level was not 
always found to be correlated with a statistical difference at the 
nucleotide level (or vice versa), possibly due to codon degeneracy 
and other factors. For example, in a retrospective investigation of 
ten patients chronically infected with HCV G3a and treated with 
peginterferon/ ribavirin, Moreau et al reported that, at the baseline 



time point, the treatment failure group was found to have a higher 
Shannon entropy at the amino acid level (but not at the nucleotide 
level) than the sustained virological responders group [41]. 

In summary, no significant difference in median SQS(nt) levels 
was observed in this study that could diflFerentiate Gla and Gib 
quasispecies in either the NS3 or NS5B region. Similar results 
have been reported in a recent smaller UDPS study, based on 8 
Gla and 6 Gib samples. [36] It will be of future interest to 
investigate whether correlations can be established between 
Shannon entropy within the NS5B region at the amino acid level 
and treatment failure rates across clinical trials of Gla and Gib 
patients treated with NS5B NIs or NNIs. 

Despite the high genetic variability of HCV, only a third of the 
amino acid positions in NS3 (-23-33%) and NS5B (-29%) were 
found to contain high-abundance substitutions among aU isolates 
in this study. This apparent discrepancy can be attributed, at least 
partly, to the fact that synonymous substitution rates in HCV are 
typically > 10-fold higher than the non-synonymous substitution 
rates in the NS3 and NS5 regions [42], leading to many more 
nucleotide changes which do not alter amino acid identities. 
Additional constraints, such as the requirement to preserve the 3D 
structures of proteins and essential base-pairing in RNA, may have 
further reduced the number of "neutral" sites, where sequence 
change can be tolerated with no significant effects on virus fitness. 

High-throughput sequencing technology such as UDPS and 
lUumina deep sequencing has provided powerful new tools to 
characterize the genomes of pathogens such as HCV, and has 
been increasingly utilized to detect and quantify rare variants, 



Table 7. Contribution of each mutational category (transition and transversion) to the development of drug-resistant 
substitutions (variants) in the NS3 and NS5B proteins reported in this study. 







NS5B 




NS3 






G1a 


Gib 


G1a 


G1b 


Single Transition 

(GBS = 1) 


C316Y (2) 


M414I/T (1/1) 


V36A/M (1/1) 


V36A (1) 




V321I (1) 


A421V (5) 


V55A/I (4/4) 


V170A(1) 




M414T/V (3/3) 


M426T (1) 


Q80R (8) 


M175L (2) 




A421V (29) 


Y448C/H (1/2) 


R155K (2) 






M423I/T/V (3/1/1) 


Y452H (4) 


A156T (1) 






M426T (1) 


V494I (2) 


V158I (1) 






C445Y (1) 


P496S (1) 


D168G/N (1/1) 






Y448C/H (1/4) 




I170T (3) 




Y452H (2) 


I482T (1) 


V494A/I (3/1) 


Double Transitions 

(GBS = 2) 








V170T (1) 


Single transversion 

(GBS = 2.5) 


C445F (1) 


L392I (1) 


T54S (4) 


T54S (1) 




I482L (1) 


M414L (2) 


Q80K (32) 


D168E (1) 






I482L (1) 


A156G (1) 








P496A (1) 


D168E/V (1/1) 




1 transition + 
1 transversion 

(GBS = 3.5) 




C316N (4) 






The genetic barrier score (GBS) was calculated according to the model described by Van de Vijver et al.(50) A score of 1 is assigned to transitions (A<->G and C<-4T) and 
2.5 to transversions (A<-4C, A<->T G<->C and G<->T). Natural polymorphisms are indicated in italic. Numbers of isolates are indicated in parenthesis. 
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which are below the detection limits of Sanger-based techniques 
such as clonal sequencing [36,43-45]. However, the sensitive 
detection of rare variants requires the differentiation between 
actual variants and technical errors originating from library 
preparation and sequencing processes. By sequencing the NS3 and 
NS5B regions of two HCV reference plasmids (Gla H77 and Gib 
Conl), the error rates (either by HCV region or by HCV subtype) 
were found to be consistent with those described in the literature. 
[46-48] Subsequently, by combining the specific error rates of our 
UDPS methodology and a Poisson distribution, we were able to 
establish a conservative cut-ofF (>0.5%) which minimized false 
positive results in our HCV variants analysis. 

In conclusion, the study reported here provide a rich source of 
data on the abundance and prevalence of HCV variants in 
treatment-naive Gl patients, and provide insights into the possible 
factors contributing to the observed differences between Gla- and 
Gib-infected patients in virological response and breakthrough 
rates. Using the largest coUertion of UDPS data of HCV isolates 
from treatment-naive patients to date, we found that, at the 
genetic level, the variability was similar in Gla and Gib isolates 
and in both NS3 and NS5B regions. We observed no clear 
difference in entropy between Gla and lb HCV, at least in the 
HCV regions studied here; the number of naturally occurring 
high-abundance and low-abundance drug-resistant variants in 
NS5B was similar in Gla and Gib. Howc\c'r, a non-significant 
but higher prevalence of Pl-resistant low-abundance variants and 
a significantly higher prevalence of high-abundance Pl-resistant 
variants were observed in Gla than Gib NS3 samples. 
Importantly, this large-scale study strongly supports the elimina- 
tion of higher genetic variability in Gla as a major contributing 
factor to the observed difierences in virological response and 
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